Search results for "record linkage"
showing 10 items of 20 documents
Active learning strategies for the deduplication of electronic patient data using classification trees.
2012
Graphical abstractDisplay Omitted Highlights? Active learning for medical record linkage is used on a large data set. ? We compare a simple active learning strategy with a more sophisticated variant. ? The active learning method of Sarawagi and Bhamidipaty (2002) 6] is extended. ? We deliver insights into the variations of the results due to random sampling in the active learning strategies. IntroductionSupervised record linkage methods often require a clerical review to gain informative training data. Active learning means to actively prompt the user to label data with special characteristics in order to minimise the review costs. We conducted an empirical evaluation to investigate whether…
The Long-Term Effects of Physical Loading and Exercise Lifestyles on Back-Related Symptoms, Disability, and Spinal Pathology Among Men
1995
Study design Historical cohort, including selected subgroups. Objectives To understand the long-term effects of exercise on back-related outcomes, back pain, sciatica, back-related hospitalizations, pensions, and magnetic resonance imaging findings were studied among former elite athletes. Summary of background data Exercise and sports participation have become increasingly popular, as have recommendations of exercises for back problems, but little is known about their long-term effects. Methods Questionnaires were returned by 937 former elite athletes and 620 control subjects (83% response rate). Identification codes allowed record linkage to hospital discharge and pension registers. Magne…
A Cohort Study of Childhood Cancer Incidence after Postnatal Diagnostic X-Ray Exposure
2009
Ionizing radiation is an established cause of cancer, yet little is known about the health effects of doses from diagnostic examinations in children. The risk of childhood cancer was studied in a cohort of 92.957 children who had been examined with diagnostic X rays in a large German hospital during 1976-2003. Radiation doses were reconstructed using the individual dose area product and other exposure parameters, together with conversion coefficients developed specifically for the medical devices and standards used at the radiology department. Newly diagnosed cancers occurring between 1980 and 2006 were determined through record linkage to the German Childhood Cancer Registry. The median ra…
Determinants of homonym and synonym rates of record linkage in disease registration.
1996
AbstractReliable record linkage is an essential component of the quality of population-based disease registration. Quality assessment of disease registries should, therefore, include quantitative approaches to describe the extent of record-linkage errors. The homonym and synonym rates have been proposed for this purpose. The homonym rate quantifies the proportion of distinct patients excluded from registration due to erroneous linkage with other patients. The synonym rate quantifies the proportion of unrecognized duplicate notifications on patients already registered in the registry. This paper provides an algebraic assessment of the determinants of both rates. It is shown how the homonym a…
Effects of record linkage errors on disease registration
1998
Abstract:Reliable record linkage is a prerequisite for high-quality population-based disease registration. Rapid developments in computer processing have made record linkage both more efficient and more reliable in recent years. At the same time, concerns about confidentiality increasingly hinder record linkage in many disease registries. This paper provides basic algebraic models describing the effects of record linkage errors on monitoring disease incidence. Homonym errors, that is, erroneous linkage of records that pertain to distinct individuals, lead to underestimation of incidence in the registry population. The degree of underestimation strongly depends on the discriminating power of…
Evaluation of Record Linkage Methods for Iterative Insertions
2009
Summary Objectives: There have been many developments and applications of mathematical methods in the context of record linkage as one area of interdisciplinary research efforts. However, comparative evaluations of record linkage methods are still underrepresented. In this paper improvements of the Fellegi-Sunter model are compared with other elaborated classification methods in order to direct further research endeavors to the most promising methodologies. Methods: The task of linking records can be viewed as a special form of object identification. We consider several non-stochastic methods and procedures for the record linkage task in addition to the Fellegi-Sunter model and perform an e…
Missing values in deduplication of electronic patient data
2011
Data deduplication refers to the process in which records referring to the same real-world entities are detected in datasets such that duplicated records can be eliminated. The denotation ‘record linkage’ is used here for the same problem.1 A typical application is the deduplication of medical registry data.2 3 Medical registries are institutions that collect medical and personal data in a standardized and comprehensive way. The primary aims are the creation of a pool of patients eligible for clinical or epidemiological studies and the computation of certain indices such as the incidence in order to oversee the development of diseases. The latter task in particular requires a database in wh…
Controlling false match rates in record linkage using extreme value theory
2011
AbstractCleansing data from synonyms and homonyms is a relevant task in fields where high quality of data is crucial, for example in disease registries and medical research networks. Record linkage provides methods for minimizing synonym and homonym errors thereby improving data quality. We focus our attention to the case of homonym errors (in the following denoted as ‘false matches’), in which records belonging to different entities are wrongly classified as equal. Synonym errors (‘false non-matches’) occur when a single entity maps to multiple records in the linkage result. They are not considered in this study because in our application domain they are not as crucial as false matches. Fa…
Work incapacity among family caregivers : a record linkage study
2022
BackgroundFamily caregiving-related physical and mental health problems may lead to work incapacity in employed caregivers. The aim of this study was to quantify sickness absences and disability pensions (SADP) among high-intensity family caregivers available to the labour market compared with a control population.MethodsThe study sample included all individuals in Finland, who had received caregiver’s allowance and were available to the labour market in 2012 (n=16 982) and their controls (n=35 371). Information on the number of sickness absence (spells >10 days) and disability pension (SADP) days and related diagnoses according to ICD-10 were obtained from national registers for the yea…
A practical framework for data management processes and their evaluation in population-based medical registries.
2013
We present a framework for data management processes in population-based medical registries. Existing guidelines lack the concreteness we deem necessary for them to be of practical use, especially concerning the establishment of new registries. Therefore, we propose adjustments and concretisations with regard to data quality, data privacy, data security and registry purposes.First, we separately elaborate on the issues to be included into the framework and present proposals for their improvements. Thereafter, we provide a framework for medical registries based on quasi-standard-operation procedures.The main result is a concise and scientifically based framework that tries to be both broad a…